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Protein L and Hybrid Proteins Thereof, 

&: 

The present invention relates to sequences of protein L 
which bind to light chains of inmunoglobulins. The 
5 invention also relates to hybrid proteins of protein L jV^. 
having the ability to bind to light chains of all Ig and *^<v 
also to bind to light and heavy chains of immunoglobulin PS^*" 
G, DNA-sequences which code for the proteins vectors k 
that contain such DNA-sequences, host cells transformed 
10 by the vectors, methods for preparing the proteins, 

reagent apparatus for separating and identifying immuno- ^J^* 
globulins, compositions and pharmaceutical compositions 
which contain the proteins. 

15 The invention relates in particular to the DNA-sequence KrW 

and to the amino acid sequence of the light-chain form- 
ing domains of protein L. 

Proteins which bind to the constant domains (of high /a^!* 
2 0 affinity) of the immunoglobulins (ig) are known* Thus, 

protein A (from Staohvlococcus aureus ^ (Forsgren, A. and 
Sjoquist, J. (1966) Protein A from St<iphylococcus v^J^' 
aureus. I* Pseudo-immune reaction with human gamma- *^i«(V- 
globulin. J. Immunol. 97: 822-827) binds to IgG from 

2 5 various mammal species. The binding of protein A to IgG ? J* 

is mediated essentially via surfaces in the Fc-fragment 

of the heavy chain of the IgG-molecule , although a u-^^vl 
certaxn bond is also effected with surfaces in the Fab- illAV 
fragment of the IgG. Protein A lac)cs the ability of 

3 0 binding to human IgG3 and neither will it bind to IgG 

from several other animal species, such as important 

laboratory animals, for instance rats and goats, which iiiS^- 
limits the use of protein A. 9mSm 

35 Protein G (Bj6rc)c, L. and Kronvall, G. (1984) Purifica- 

tion and some properties of streptococcal protein G, a 
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novel IgG-binding reagent. J. Immunol, 133: 969-974; 
Reis, K., Ayoub, E. and Boyle, M. (1984) Streptococcal 
Fc receptors. I. Isolation and partial characterization 
of the receptor from a group C streptococcus* J. 
Immunol. 132: 3091-3097) binds to heavy chains in human 
IgG and to all four of its stibclasses and also to IgG 
from most mammals, including rats and goats. 



15 break the bond with a weak agent, for instance when 



Protein M {Applicant's Patent Application PCT/SE 
20 91100447) binds to the Fc-fragment in IgG from humans, 

monkeys, rabbits, goats, mice and pigs. 



ST 



Protein H (Akesson, P., Cooney, J., Kishimoto, F. and 
10 Bjorck, L, (1990) Protein H - a novel IgG binding bacte- ^^^^ 

rial protein. Molec. Immun. 27: 523-531) binds to the 



Fc-fragment in IgG from htiman beings, monkeys and rab- ^© 

bits. However, the bond is weaker than in the case of j^A 
protein G and A, which may be beneficial when wishing to 

purifying proteins which are readily denatured with the ~j| 

aid of antibodies. JcVf' 



Protein L (Bjorck, L. (1988) Protein L, a novel bacteri- 
al cell wall protein with affinity to Ig L chains. J. &^ 
25 Immunol. 140: 1194-1197), which binds to the light 

chains in immunoglobulins from all of the classes G, A, ^'^^^K- 
M, D and E is known (USP 4,876,194). The amino acid se- l^K- 
quence and the binding domains of this protein, however, 
have hitherto been unknown. - 

30 

The aforesaid proteins can be used in the analysis, 
purification and preparation of antibodies and for 
diagnostic and biological research. i^^^ 

3 5 The elimination of immunoglobulins, with the aid of \^v* 
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plasmapheresis, can have a favourable effect on some 
autoimmune diseases. A broadly binding protein would be 
an advantage when wishing to eliminate all classes of 
antibodies in this context. 

It has long been known that infectious conditions can be 
prevented or cured with the introduction of an immune 
serum, i.e. a serum which is rich in antibodies against 
the organism concerned or its potentially harmful pro- 
duct. Examples hereof are epidemic jaundice, tetanus, 
diphtheria, rabies and generalized shingles. Antibodies 
against a toxic product may also be effective in the 
case of non-infectious occasioned conditions. Serum 
produced in animals against different snake venoms is 
the most common application in this respect. However, 
the administration of sera or antibody preparations is 
not totally without risk. Serious immunological reac- 
tions can occur in some cases. Singular cases of the 
transmission of contagious diseases, such as HIV and 
hepatitis through the agency of these products have also 
been described. In order to avoid these secondary ef- 
fects, it has been desirable to produce therapeutic 
antibodies in test tubes- A large number of novel tech- 
niques for the preparation of antibodies in test tubes 
have been proposed in recent years. Examples of such 
techniques are hybridom techniques, synthesis of chima- 
antibodies and the preparation of antibodies in bacte- 
ria. These techniques also enable antibodies to be 
specially designed which can fur-ther widen the use of 
such molecules as therapeutics, for instance in the case 
of certain tumour-diseases. In the case of some of these 
novel methods, however, the product totally lacks the 
Fc-fragment to which all of the described IgG-binding 
proteins, with the exception of protein L, bind. There 
is consequently a need of a process for purifying anti- 



SUBSTITUTE SHEET 



' m * »\ «^ 



m 

r 



V V , 




wo 93/22342 



PCr/SE93/00375 



bodies for therapeutic use, wherein proteins which have 
a broad binding activity/specificity, can be of value. 



m 



i 



It has long been possible to utilize the antibody reac- 
5 tion with its high grade specificity for diagnosing past 
or, in some cases, ongoing infections with different 
parasites. This indirect method of indicating infectious 
agents is called serology and, in many cases, may be the 
only diagnostic alternative. In certain cases, it can 

10 also be of interest to exhibit specific IgE- or IgA- 
antibodies. When diagnosing with the aid of serology, 
the antigen is most often fastened to a solid phase, 
whereafter serum taken from the patient is incubated 
with the antigen. Antibodies that have been bound from 

15 the patient can then be detected in different ways, 

often with the aid of a secondary antibody (for in- 
stance, an antibody which is directed against the light 
chains of human antibodies) to which an identifiable 
label has been attached, such as alkaline phosphatase, 

20 faiotin, radioactive isotopes, fluorescein, etc. In this 

context, a protein having a broad Ig binding capacity 
can be used as an alternative to secondary antibodies. 

There are a number of non-therapeutic and non-diagnostic 
25 reasons for the necessity to bind antibodies. Antibodies 

are often used in research, both for detection and for 
purifying the antigen against which they are directed. 
All techniques which facilitate the purification of 
antibodies and, in particular, techniques which enable 
30 different classes to be purified, are of interest in 

this context. 



m 
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35 



Consequently, there is a serious need of a protein which 
has a broad binding activity/specificity and which binds 
to several different classes of immunoglobulins from 
different animal species. At present, there is no known 
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protein which will bind to all immunoglobulin classes. 
The earlier known proteins A, G, H and M bind only to 
heavy chains in IgG- The )cnown protein L (Bjdrck et al, 
1988) binds to the light x-c^^ins and 7-chains in im- 
5 munoglobulins of all classes, although the bonds are 

much weaker on the x-chains. Applicant has charted pro- 
tein L, has determined the amino acid sequence for 
protein L, has identified the light-chain binding do- 
mains on protein L, and has used these to produce hybrid 

10 proteins which possess the IgG-Fc-binding domains of 

protein G. The Applicant is able to show through protein 
LG that a protein of broader binding activity/ 
specificity can be produced thereby. The aforesaid 
proteins A, G, H and M bind to the same surfaces, or to 

15 very closely lying surfaces on IgG-Fc. The protein L 
which binds to light chains can thus be combined with 
any other functionally similar protein which binds to 
the Fc-fragment of heavy chains. A similar broadening of 
the Ig-binding activity is achieved with all alterna- 

20 tives. 

Thus, the present invention relates to the sequence of 
protein L which binds to light chains in Ig and has the 
amino acid sequence disclosed in Figure 1, and variants, 
25 subf ragments , multiples or mixtures of the domains B1-E5 

having the same binding properties. The invention also 
relates to a DNA-sequence which codes for such protein 
sequences, for instance the DNA-sequence in Figure 1. 

3 0 The invention is concerned with a hybrid protein which 

is characterized by comprising domains which bind to the 
light X"C^2iins and X-chains in immunoglobulins of all 
classes, and also comprises domains which bind to heavy 
chains in immunoglobulin G, wherein those domains which 

3 5 bind to the light chains are chosen from among the B1-, 

B2-, B3-, B4- and B5-domains in protein L (see Claim 1) 
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and those domains which bind to heavy chains of immuno- 
globulins are chosen from the C1-, C2- and C3-domains in 
protein G; the B- and Cl-domains from protein H; the 
B1-, 82- and S-domains in protein Ml or the Z~, , 
5 B- and C-domains in protein A (see Figure 6) and 

variants, subf ragments , multiples or mixtures of these 
domains that have the same binding properties which bind 
to heavy chains of immunoglobulins. 

By subfragment is meant a part-fragment of the given 
domains or fragments which include parts from the vari- 
ous domains having mutually the same binding properties. 
By variants is meant proteins or peptides in which the 
original amino acid sequence has been modified or 
changed by insertion, addition, substitution, inversion 
or exclusion of one or more amino acids, although while 
retaining or improving the binding properties. The 
invention also relates to those proteins which contain 
several arrays (multiples) of the binding domains or 
mixtures of the binding domains with retained binding 
properties- The invention also relates to mixtures of 
the various domains of amino acid sequences having 
mutually the same binding propp-rties. 

25 The invention relates in particular to a hybrid protein 

designated LG, and is characterized in that the hybrid 
protein includes the B-domains in protein L which bind 
to the light chains in immunoglobulins, and the Cl- 
domains and C2-domains in protein G which bind to heavy 

3 0 chains and have the amino acid sequence disclosed in 

Figure 3. The invention also relates to variants, sub- 
fragments, multiples or mixtures of these domains. 



15 



Protein LG is a hybrid protein having a molecular weight 
35 of about 50 kDa (432 amino acids) and comprising four 

domains, each of which binds to light chains in immuno- 
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globulins, and two IgG-binding domains from protein G. gSJ*. 

The hybrid protein combines a broad IgG-bxndxng activi- w>y, 
ty, deriving from the high--grade binding ability of 

protein G to the Fc-fragment of the heavy chain on IgG imami 

5 with the ability of the protein L to bind to light 

chains of all classes of immunoglobulins. Thus, protein 0 

LG binds polyclonal human IgG, IgM, IgA, IgD and IgE. SjK^" 

10 -1 2SiC<- 

The affinity for human polyclonal IgG is 2 x 10 M . mtm^ 

All four human immunoglobulin classes are bound. Binding 

10 to human IgG is effected with both the jc-and the X- |l?v*** 



35 



chain. Both the Fc-fragment and the Fab-fragment of IgG 



with elektroblotting. The protean can be immobilized on 



are bound to the hybrid protein. The protein also binds . . 
human IgA-, IgD-, IgE- and IgM-antibodies* The bond is 

stronger to human immunoglobulins which carry x than to ^"fe 

15 those which carry the X-isotope of light chains. IgG ^v^' 

from most mammals will be bound by protein LG, thus also 

IgG from goats and cows, which do not bind to protein L. ^yj^l 

However, rabbit-IgG which binds relatively wecikly to ^rf^' 



protein L will bind well to the fusion protein. IgM and 
20 IgA-antibodies from mice, rats and rabbits will be bound ^* "^ ' ^ 

to the protein- 

Protein LG is highly soluble. It is able to withstand 
heat and will retain its binding properties even at high 
25 temperatures. The binding properties also remain in a 

broad pH-range of 3-10. The protein withstands aetergent 

and binds marked or labelled proteins subsequent to Jv^aI;! 

separation in SDS-PAGE and transference to membranes >VgW 



3 0 a solid phase (nitrocellulose, Immobilon®, polyacryl- 



amide, plastic, metal and paper) without losing its 
binding capacity. The binding properties are not influ- 

enced by marking with radioactive substances, biotin or 5^5^ 

alkaline phosphatase. (The binding abilities of the ^^%5;^v 

protein LG are disclosed in Example 3) . •^'i^>. 
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The protein comprises 432 amino acids and has a molecu- 
lar weight of 50 kDa deriving therefrom. The sequence is 
constructed of an ala sequence of the three last amino 
acids in the A-domain of the protein L (val-glu-asn) , 
5 this ala sequence being unrelated to the two proteins, 
whereafter the four mutually high-grade homologous B- 
domains from protein L follow. The first of the B-do- 
mains is comprised of 76 amino acids, and the remaining 
domains are each comprised of 72 amino acids. The first 

10 nine amino acids from the fifth B-domain are included 
and followed by two non-related amino acids (pro-met) . 
The protein G-sequences then follow. The last amino acid 
in the so-called S-domain from protein G is followed by 
an IgG-binding domain from protein G (CI; 55 amino 

15 acids) , the intermediate D-region (15 amino acids) and 
the second IgG-binding C-domain (C2; 55 amino 
acids) . The last amino acid is a methionine, which 
occurs in natural protein G as the first amino acid in 
the so-called W-region. 

20 

The invention also relates to DNA-sequences which code 
for the aforesaid proteins. 

The gene which codes for the IgG-binding amino acid 
25 sequences can be isolated from the chromosomal DNA from 

Staphylococcus aureus based on the information on the 
DNA-sequence for protein A (S. Lofdahl, B. Guss, M. 
Uhlen, L. Philipsson and M. Lindberg, 1983- Gene for 
staphylococcal protein A. Proc. Natl, Acad. Sci. USA. 
30 80: 697-701) and Figure 6, or from G-streptococcus , 

preferably strain G 148 or C-streptococcus , preferably 
strain Streptococcus equisimilis C 40, based on the 
information on protein G (B. Guss, M. Eliasson, 
A. Olsson, M. Uhlen, A.-K. Frej, H. Jorvall, I. Flock 
3 5 and M. Lindberg. 1986. Structure of the IgG-binding 
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regions c streptococcal protein G, EMBO» J. 5: 1567- 
1575) and Figure 6, or from group A-streptococcus , 
e.g, S . pyogenes (type Ml) based on the information on 
the DHA-sequence for protein H (H. Gomi, T. Hozumi, 
S. Hattori, C. Tagawa, F. Kishimoto and L. Bj6rc)c, 1990. 
rae gene sequence and some properties of protein H - a 
novel IgG binding protein J- Immunol. 144: 4046-4052) 
and Figure 6, or from the chromosomal DNA in group A- 
streptococcus type Ml based on the information on the 
DHA-sequence for protein M (Applicant's Patent Applica- 
tion, PCT/SE 91100447) and Figures 6 and 7. The gene 
which codes for the protein that binds to light chains 
can be isolated from the chromosomal DNA from Pepto-co- 
ccus maanus 312 based on the information on the DNA- 
seguence for protein L in Claim 2. 

By using the chromosomal DNA't obtained from the afore- 
said bacteria as a template, a DNA-fragment defined with 
the aid of two synthetic oligonucleotides can then be 
specifically amplified with the aid of PGR (Polymerase 
Chain Reaction) . This method also enables recognition 
sites to be incoirporated for restriction enzymes in the 
ends of the amplified fragments (PGR technology, Ed: PGR 
Technology. Principles and Applications for DNA Amplifi- 
cation. Ed. Henry Erlich. Stockton Press, New York, 
198 9) . The choice of recognition sequences can be adapt- 
ed in accordance with the vector chosen to express the 
fragment or the DNA-fragment or other DNA-f ragments with 
which the amplified fragment is- intended to be combined. 
The amplified fragment is then cleaved with the restric- 
tion enzyme or enzymes concerned and is combined with 
the fragment/ the other fragments concerned and the 
fragments are then cloned together in the chosen vector 
(in this case, the expression vector) (Sambrook, J.E. 
Fritsch and T, Maniatis, 1989, Molecular cloning: A 
laboratory manual, 2nd Ed. Gold Spring Harbor Laborato- 
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ries. Cold Spring Harbor, New York, USA), The piasmid 
vector pHD3l3 can be used (Dalboge, H.E* Bech Jensen, 
H, Tottrup, A. Grubb, M» Abrahamscn, I. Olafsson and S. 
Carlsen, 1989. High-level expression of active human 
cystatin C in Escherichia coli. Gene, 79: 325-332), 
alternatively one of the vectors in the so-called PET- 
series (PET 20, 21, 22, 23) retailed by Novagen (Madi- 
son, Wisconsin, USA). 



10 
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The hybrid proteins are then incorporated in an appro- 
priate host, preferably E. coli. The invention also 
relates to such hosts as those in which the hybrid 
proteins are incorporated. 

Those clones which produce the desired proteins can be 
selected from the resultant transf ormants with the aid 
of a )cnown method (Fahnestock et al., J. Bacteriol. 167, 
870 (1986). 

When the proteins that can bind to the light chains in 
the immunoglobulins and to the heavy chains in IgG have 
been purified from the resultant positive clones with 
the aid of conventional methods, the binding specifici- 
ties of the proteins are determined for selection of 
those clones which produce a protein that will bind to 
the light chains in immunoglobulins and to the heavy 
chains in IgG. 

Subsequent to having isolated piasmid DNA't in said 
clone with conventional methods, the DNA-sequence in the 
inserted material is determined with known methods 
(Sanger et al. , Proc. Natl. Acad. Sci. USA 74, 5463 
(1977) , 
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The invention also relates to DNA-sequences which hy- 
bridize with said identified DNA-sequences under conven- 



1 
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tional conditions and which code for a protein that 
possesses ,ae desired binding properties. Strict hybrid- 
izing conditions are preferred. 

5 Expression of the genes can be effected with expression 

vectors which have the requisite expression control 
regions, the structural gene being introduced after said 
regions. As illustrated in Figure 1 and Claim 2, the 
structural gene can be used for protein LG or other 
10 hybrid proteins with protein L. 

with regard to expression vectors, different host-vec- 
tor-systems have been developed, of which the most 
suitable host-vector-systems can be selected for expres- 
15 sion of the genes according to the present invention. 

The present invention also relates to a method of pro- 
ducing the inventive hybrid proteins by cultivating a 
host cell which is transformed with an expression vector 
2 0 in which DNA't which codes for the proteins according to 

the invention is inserted. 

This method includes the steps of 

25 (1) inserting into a vector a DNA-fragment which codes 

for the hybrid proteins; 

(2) transforming the resultant vector into an appropri- 
ate host cell; 

30 

(3) cultivating the resultant, transformed cell for 
preparation of the desired hybrid protein; and 

(4) extracting the protein from the culture. 
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In the first step, the DNA-fragment which codes for the 
hybrid protein is inserted in a vector which is suitable 
for the host that is to be used to express the hybrid 
protein. The gene can be inserted by cleaving the vector 
with an appropriate restriction enzyme, and then legat- 
ing the gene with the vector. 

In the second step, the vector with the hybrid plasmid 
is inserted into host cells* The host cells may be 
Escherichia, coli. Bacillus subtilis or Saccharomvces 
cerevisiae or other suitable cells. Transformation of 
the expressions hybrid vector into the host cell can be 
effected in a conventional manner and clones which have 
been transformed can then be selected. 

In the third step, the obtained transf ormants are culti- 
vated in an appropriate medium for preparation of the 
desired proteins by expression of the gene coded for the 
hybrid protein » 

In the fourth step, the desired protein is extracted 
from the culture and then purified. This can be achieved 
with the aid of known methods. For instance, the cells 
can be lysed with the aid of known methods, by treating 
the cells with ultrasonic sound, enzymes or by mechani- 
cal degradation. The protein which is released from the 
cells or which excretes in the medium can be recovered 
and purified with the aid of conventional methods often 
applied within the biochemical fdeld, such as ion-ex- 
change chromatography, gel filtration, affinity chroma- 
tography with the use of immunoglobulins as ligands, 
hydrophobic chromatography or reverse-phase chromato- 
graphy. These methods can be applied individually or in 
suitable combinations. 



35 
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As before mentioned, the inventive proteins may be used 
for binding, identifying or purifying inununoglobulins. 
They can also be ix>und to pharmaceuticals and used in 
formulations which have delayed release properties. To 
this end, the protein may be present in a reagent appli- 
ance for pharmaceutical composition in combination with 
appropriate reagents, additives or carriers. 

The proteins can be handled in a freeze-dried state or 
in a PBS-solution (phosphate-buffered physiological salt 
solution) pH 7.2 with 0.02% NaN^. It can also be used 
connected to a solid phase, such as carbohydrate-based 
phases, for instance CNBr-activated sepharose, agarose, 
plastic surfaces, polyacrylamide, nylon, paper, magnetic 
spheres, filter, films. The proteins may be marked with 
bictin, alkaline phosphatase, radioactive isotopes, 
fluorescein and other fluorescent substances, gold 
particles, ferritin, and substances which enable lumi- 
nescence to be measured. 

Other proteins may also be used as carriers. These 
carriers may be bound to or incorporated in the pro- 
teins, in accordance with the invention. For instance, 
it is conceivable to consider the whole of proteins A, 
G, H, M as carriers for inserted sequences of protein L 
which bind to light chains. In turn, these carriers can 
be bound to the aforesaid carriers. 

The pharmaceutical additions that can be used are those 
which are normally used within this field, such as 
pharmaceutical qualities of mannitol, lactose, starch, 
magnesium stearate, sodium saccharate, talcum, cellu- 
lose, glycose, gelatine, saccharose, magnesium carbonate 
and similar extenders, such as lactose, dicalcium phos- 
phate and the like; bursting substances, such as starch 
or derivatives thereof; lubricants such as magnesium 
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stearate and the like; binders, such as starchy gum 
aribicum, polyvinylpyrrolidone, gelatine, cellulose and 
derivatives thereof, and the like. 

The invention will now be described in more detail with 
reference to the accompany drawings, in which 

Figure 1 illustrates the plasmid pHD389; the ribosomal 
binding sequence, the sequence for the signal peptide 
from ompA and recognition sequence for several restric- 
tion enzymes are shown; 
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Figure 2 illustrates the amino acid and nucleic acid 
sequence for protein LG; 

Figure 3 is a schematic overall view of the production 
of protein L; 

Figure 4 is a schematic overall view of the production 
of protein LG; 

Figures 5a, 5b and 5c are schematic overall views of the 
production of the hybrid proteins LA, LM and LH respec- 
tively; 

Figure 6 is a schematic inclusive illustration of pro- 
tein A, G, H and Ml, IgGFc-binding domains are for 
protein A: E, D, A, B and C; for protein G: Cl, C2 and 
C3; for protein H: A and/or B; ajnd for protein Ml: A, 
Bl, 82, B3 and S; 



w. 
m 



Figure 7 illustrates the amino acid and nucleic acid se- 
quence for protein Ml; 
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Figure 8 illustrates Western Blot for protein G, L and 
LG with certain inonunoglobulins and irununoglubulin 
fragments; and 

5 Figure 9 illustrates Slot-Blot for protein L, G and LG 
with IgG, Igx and Ig Fc. 

The amino acid and nucleic acid sequence of the light- 
chain binding domains of protein L is illustrated in 
10 claims X and 2 respectively. 

It will be observed that the drawings are not to scale. 

Example I 

15 

Cloning and expression of the IgG-light-chain-binding 
doniaj.ns in FrQtein I, 

Construction of synthetic oligonucleotides (primers) for 
2 0 amplifying sequences coded for protein domain B1-B4 

It has been found that a protein h peptide (expressed in 

£. coli ^ constructed of the sequence ala-val-glu-asn- 

domain Bl (from protein L) binds to the light chains of 

25 the immunoglobulins (W. Kastern, U. Sjobring and L. 

Bjorck, 1992. Structure of peptostreptococcal protein L 

and identification of a repeated immunoglobulin light 

chain-binding domain. J. Biol. Chem. in-print) . since 

this simple protein L-domain has a relatively low affin- 
7 -1 

30 ity to Ig, (1 X 10 M ), and since the naturally occur- 
ring protein L which is constructed of several mutually 

similar domains (B1-B5) has a high affinity to Ig (1 x 
10 -1 

10 M ) four of these domains have been expressed 
together in the following way: 

35 
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PL-N and PL-Cl are synthetic oligonucleotides (manufac- V 
tured by the Biomolecular Unit at Lund University (Swe- 'f. 
den) in accordamce with Applicant's instructions) which 
have been used to amplify a clonable gene fragment which ^ 
5 is amplified with PGR (Polymerase Chain Reaction) and K 
which codes for four Ig-binding protein L domains (ala- 
val-glu-asn-Bl-B2-B3-B4-lys-lys-val-asp-glu-lys-pro-glu- 
glu) . Amino acids in the protein L-sequence are given 
for the primer which corresponds to the coded strand 
10 (PL-N) : 

* \ 

PL-N : 5 ' -GCTCAGGCGGCGCCGGTAGAAAATAAAGAAGAAACACCAGAAAC-3 ' 

valgluasnlysglugluthrproglu 

15 

5 '-end of this oligonucleotide is homologous with the *^ 
coded strand in the protein L-gene (emphasized) : those 
codons which code for the last three amino acids in the 
A-domain (val-glu-asn) are followed by the codons for 
2 0 the first six amino acids in the first of the Ig-binding 
domains in protein L (Bl) . 

PL-Cl: 5 ' -CAGCAGCA^ATTCIIAITATTCTTCTGGTTTTTCGTCAAC^^ 
CTT-3 ' 

25 

This oligonucleotide is homologous with the opposing 
non-coding strand in the gene for protein L (the se- 

quence corresponds to the first nine amino acids in A^"* 
domain B5) . * 7^ 

30 a:. 

DNA- fragments which have been amplified with the aid of V 

PL-N contain the recognition sequence for the restric- ^ 

tion enzyme HpalX (emphasized) immediately before the ^ 

codon which is considered to code for the first amino !<I 
35 acid (val) in the expressed protein L-fragment. The 

fragment which is cleaved with Hpall can be ligated with ^! 
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DNA (in this case, consisting of the used expression 
vector pHD389) which has been cleaved with the restric- 
tion enzyme Marl. The DNA-f ragment that has been cleaved 
with Hpall and ligated with vector pHD389, which has 
5 been cleaved with MaxI, will be translated in the cor- 
rect reading frame. The construction results in trans- 
lation of an additional amino acid (ala) immediately in 
front of the first amino acid in protein L. 

10 DNA-fragments which have been amplified with the aid of 
PL-Cl will contain the recognition sequence for the 
restriction enzyme BaaKI (over lined above the sequence) 
immediately after the sequence which codes for the last 
amino acid in the expressed protein L-f ragment (glu) . 

15 The vector pHD389 contains a unique recognition sequence 

for BaaHI as part of its so-called multiple cloning 
sequence which follows the Narl recognition sequence. 
DNA-fragments which have been amplified with the aid of 
PL-Cl will include two so-called stop-codons (empha- 

20 sized) which results in translation of the fragment 

inserted in the vector to cease. 

The sequence which was considered to be amplified con- 
tains no internal recognition secpuences for the restric- 
25 tion enzymes HpaXX or BamJHI. 

Amplifying and cloning procedures 

(PCR) (Polymerase Chain Reaction) was effected with a 
30 protocol described by Saiki, R.D. Gelfand, S. Stoffel, 

S. Schzirf, R. Higuchi, G. Horn, K. Mullis and H. Erlich, 
1988; Primer-directed enzymatic amplification of DNA 
with a thermostable DNA polymerase. Science 239: 487- 
49127; PCR was effected in a Hybaid Intelligent Heating- 
35 block (Teddington, UK): 100 ^1 of a reaction mixture 

contained 50 mM KCl, 10 mM Tris-HCl, pH 8.3, 1.5 mM 
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of DNA (Sambrook, J-E. Fritsch and T. Maniatis, 1989. 
Molecular cloning: A laboratory manual. 2nd Ed. Cold 
Spring Harbor Laboratories, Cold Spring Harbor, New 
York, USA) . The cleaving and ligating conditions recom- 
5 mended by the manufacturer of DNA-ligase and restriction 
enzymes have been followed in other respects. 

Expression system 

10 The vector pHD389 (see Figure 2) is a modified variant 
of the plasmid pHD313 (Dalboge, H.E. Bech Jensen, H. 
Tottrup, A. Grubb, M. Abrahamson, I. Olafsson and 
S. Carlsen, 1989. High-level expression of active human 
cystatin C in Escherichia coli. Gene, 79: 325-332). The 

15 vector, which is replicated in £. coli (contains ori 

origin of replication from plasmid pUC19) is constructed 
so that DNA-fragments which have been cloned into the 
cleaving site of Karl will be transcribed and translated 
downstream of and in the immediate vicinity of the 

20 signal peptide (21 amino acids), from envelope -protein 

ompA from £. coli . Translation vill be initiated from 
the codon ATG which codes for the first amino acid 
(methionine) in the signal peptide. This construction 
permits the translated peptide to be transported to the 

25 periplasmic space in E. coli . This is advantageous, 

since it reduces the risk of degradation of the desired 
product of enzymes occurring intracellular ly in E. coli. 
Moreover, it is easier to purify peptides which have 
been exported to the periplasic: space. Unique recog- 

30 nition sequences (multiple cloning sequences) for sever- 
al other restriction enzymes, among them ecoRI, Sail and 
BamHI are found immediately after the KaxX cleaving 
site. An optimized so-called Shine-Dalgarno-seguence 
(also called ribosomal binding site, RBS) is found seven 

35 nucleotides upstream from the ATG-codon in the signal 

sequence from ompA, this optimized sequence binding to a 
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complementary sec[uence in 16S rRNA in the ribosomes and 
is responsible for the translation being initiated in 
the correct place. The transcription of such DNA as that 
which is CO- transcribed with the signal sequence for 
5 oBpA is controlled by the P -promoter from coliphage X. 

R 

The vector also contained the gene for cI857 from coli- 
phage X whose product down-regulates transcription from 
P (and whose product is expressed constitutive ly) . This 
cI857-mediated down-regulation of transcription from P^^ 

10 is heat-sensitive. The transcription regulated from this 
promoter is terminated with the aid of a so-called rho- 
independent transcription terminating sequence (forms a 
structure in DNA't which results in the DNA-dependent 
RNA-polymerase leaving the DNA-strand) which is placed 

15 in the vector immediately downstream of the multiple 

cloning sequence. The plasmid also carries the 6-lacta- 
mase gene (from the plasmid pUCl9) whose product permits 
ampicillin-selection of £* coli clones that have been 
transformed by the vector. 



20 



Selection of protein L-producing clones 



The transformed bacteria are cultivated, or cultured, on 
cultxire plates with an LB-medium which also contained L 
25 ampicillin in a concentration of 100 fxg/ml. Cultivation ^ 

of the bacteria progressed overnight at 30*C, whereafter 
the bacteria were transferred to an incubator where they v> 
were cultivated for a further 4 hours at 42**C. The 

plates were kept in a refrigerator overnight. On the ^ 
3 0 next day, the colonies were transferred to nitrocellu- 

lose filters. Filters and culture plates were marked so •/ 
as to enable the transferred colonies to be readily ^^ 
identified on respective culture plates. The culture 
plates were again incubated overnight at lO^'C, so that 
35 remaining rests of transferred bacteria colonies could -^^ 
again grow. The plates were then kept in a refrigerator. 

y. 
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The bacteria in the colonies on the nitrocellulose- 

impressions were lysed by incubating the filter in 10% 

SDS for 10 minutes- Filters containing lysed bacteria 

were then rinsed with a blocking buffer which comprised 

5 PBS (pH 7.2) with 0.25% gelatine and 0.25% Tween-20 

(four baths, 250 ml each at 37»C), whereafter the filter 

was incubated with radioactively marked (marked with 
125 

I in accordance with the chloramin-T-method) Ig-x- 
chains (20 ng/ml in PBS with 0.1% gelatine). The incuba- 

10 tion took place at room temperature over a period of 3 

hours, whereafter non-bound radioactively marked protein 
was rinsed-off with PBS (pH 7.2) containing 
0.5 M NaCl, 0.25% gelatine and 0.25% Tween-20 (four 
baths, 250 ml each at room temperature) . All filters 

15 were exposed to X-ray film. Positive colonies were 

identified on the original culture plate. Clones which 
reacted with Ig-ic-chains were selected and analyzed with 
respect to the size on the DNA-fragment introduced in 
the vector. One of these clones was selected for the 

20 production of protein L, pHDL. The DNA't introduced from 

this clone into plasmid pHD389 was sequenced. The DNA- 
sequence was found to be in full agreement with corre- 
sponding sequences (B1-B4 and 21 bases in B5) m the 
gene for protein L from Peptostreptococcus macrnus , 

25 strain 312. The size and binding properties of the 

protein produced by clone pHDL was analyzed with the aid 
of SDS-PAGE (see Figure 8) , dot-blot experiment (see 
Figure 9) and competitive binding experiments. 




'MM- 



3 0 Production of protein L 

Several colonies from a culture plate with E. coll pHDL 
were used to inoculate a preculture (LB-medium with an 
addition of 100 mg/1 ampicillin) , which was cultured at 
35 28**C overnight. On the following morning, the preculture 
was transferred to a larger volume (100 times the volume 
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of the preculture) of fresh LB -medium containing ampi- 
cillin (100 mg/1) and was cultured in shake-flasks (200 
rpm) , (or fermentors) at 28 *C, The culture temperature 
was raised to 40*»c (induction of transcription) when the 
5 absorbency value at 620 nm reached 0.5. Cultivation then 

continued for 4 hours (applied solely to cultivation in 
shake-flasks) . Upon completion of the cultivation pro- 
cess, the bacteria were centrifuged down. The bacteria 
were then lysed with an osmotic shock method at 4*C 

10 (Dalbdge et al., 1989 supra). The lysate was adjusted to 

a pH = 7. Remaining bacteria rests were then centrifuged 
down, whereafter the supernatent was purified on IgG- 
sepharose in accordance with earlier described protocol 
for protein G and protein L (U. Sjobring, L. Bjorck and 

15 W. Kastern. 1991.. Streptococcal protein G: Gene struc- 
ture and protein binding properties. J. Biol. Chem. 266: 
399-405; W. Kastern, U. Sjobring and L. Bjorck. 1992. 
Structure of peptostreptococcal protein L and identifi- 
cation of a repeated immunoglobulin light chain-binding 

20 doman. J, Biol, Chem. in-print. 

The expression system gave about 20 mg/1 of protein L 
when cultivation in shake-flasks. The culture was depos- 
ited at DSSM, Identification Reference DSSM E. coll 
25 LE392/PHDL. 
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Example 2 

Cloning and expression of protein LG 

5 Construction ot oligonuclttotides (primers) for amplify- 
ing seguancss vhich code for protein LG 

Protein L 

10 It has been found that a protein L-peptide (expressed in 

£. coli ) constructed of the sequence ala-val-glu-asn- 
domain Bl (from protein L) will bind to the light chains 
of the immunoglobulins (Kastern, Sjobring and Bjorck, 
1992, J« Biol. Chem. in-print) . Since the affinity of 

15 this simple domain to Ig is relatively low (1 x 10 M 

^) and since the naturally occurring protein L, which is 

comprised of several mutually similar domains (B1-B5) 

10 -1 

has a higher affinity to Ig (1 x 10 M ), four of 
these domains have been expressed together in the fol- 
2 0 lowing way: 

PL-N and PL-C2 are synthetic oligonucleotides (manufac- 
tured at the Biomolecular Unit at Lund University (Swe- 
den) in accordance with Applicant's instructions) which 
25 were used, with the aid of PGR {Polymerase Chain Reac- 
tion) to amplify a clonable gene fragment, called Bl-4, 
which codes for four Ig-binding protein L domains (ala- 
val-glu-asn"Bl-B2-B3-B4-lys-lys-val-asp-glu-lys-pro-glu- 
glu) : 

30 

PL-N : 5 ' -GCTCAGGCGGCG CCGG TAGAAAATAAAGAAGAAACACCAGAAAC- 3 ' 

valgluasnlysglugluthrproglu 

P1-C2 : 5 ' -CAGCAGCAGCC ATGGG TTCTTCTGGTTTTTCGTCAACTTTCTTA- 
35 3' 
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Amino acids have been shown under corresponding triplets 
in the coded strand. DNA- fragments which have been 
amplified with the aid of PL-N contain the recognition 
sequence for the restriction enzyme Hpall immediately 
5 upstream of the triplet which codes for the first amino 
acid (val) in the expressed protein L-fragment. The 
fragment that has been cleaved with Hpall can be ligated 
with DNA (in this case, the used expression vector 
pHD389) which has been cleaved with Harl. The construe- 

10 tion results in translation of an extra amino acid (ala) 
immediately upstream of the first amino acid in the 
protein L-fragment. The DNA-fragment that has been 
amplified with the aid of PL-C2 will contain the recog- 
nition sequence for the restriction enzyme HcoX (empha- 

15 sized) immediately downstream of the sequence which 

codes for the last amino acid in the expressed protein 
L-fragment (glu) . Amplified fragments which have been 
cleaved with Heel can be ligated to the Ncol-cleaved, 
PCR-generated protein-asp-CDC-met-f ragment (see below) . 

20 

Protein G \ 

It is known that a simple C-domain from protein G will 
25 bind to IgG (B. Guss, M. Eliasson, A* Olsson, M. Uhlen, 

A.-K* Frej, H. Jornvall, I. Flock and Lindberg. 1986. 
Structure of the IgG-binding regions of streptococcal 
protein G. EMBO. J, 5: 1567-1575). The strength at which 
a simple C-domain binds to IgG Is relatively low 
30 (5 X 10 M ) . A fragment which consists of two C-do- 

mains with an intermediate D-region having a length of 

15 amino acids, however, has a considerably higher 

, . 9 -1 

affinity to IgG (1 x 10 M ). CDC-N and CDC-C are 

oligonucleotides which have been used as PCR-primers to 

3 5 amplify a clonable DNA-f ragment, designated CDC, which 
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codes for two IgG-binding protein G-domains (pro-met- 
asp-CDC-met) . 

CDC-N : GG?K!KT5SACACTTACAAArrAATCClTAATGGT 
^ metaspthrtyrlysleuileleuasngly 

CDC-C : CAGGTCGA CTTATTA CATTTCAGTTACCGTAAAGGTCTTAGT 

Amino acids in the resultant sequence have been shown 
10 beneath the primer of the coding strand. DNA-f ragments 
which have been 2anplified with the aid of CDC-N contain 
the recognition sequence for the restriction enzyme Ncol 
(marked with a line above the sequence) . Cleaved ampli- 
fied fragments can be ligated with the fragment that has 
15 been amplified with the aid of PL-C2 and then cleaved 

with Ncol. The fragment will therewith be translated to 
the correct reading frame. DNA-f ragments which have been 
amplified with the aid of CDC-C will contain two so- 
called stop Condons (emphasized) which terminate trans- 

2 0 lation. The recognition sequence for the restriction 

enzyme Sail (marked with a line above the sequence) 
follows immediately afterwards, this sequence also being 
found in the expression vector pHD389 (see Figure 1) . 

25 Those sequences which code for the binding properties of 

protein L (B1-B5) and for protein G (CDC) respectively 
contain no internal recognition sequences for the re- 
striction enzymes Hpall, Sail or Kcol. 

3 0 Amplif if ication and cloning procedures 

PCR (Polymerase Chain Reaction) was carried out in 
accordance with a protocol described by Saiki et al., 
1988; PCR was carried out in a Hybaid Intelligent Heat- 
3 5 ing-block (Teddington, UK) : 100 m1 of the reaction 

mixture contained 50 mM KCl, 10 mM Tris-HCl, pH 8.3, 1.5 
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mM MgCl^, 100 fxg/ml gelatine, 300 with respect to 
each of the deoxynucleotides (dATP, dCTP, dGTP, dTTP) , 
(Pharmacia) . In order to amplify sequences which code 
for the light-chain binding parts of protein L, there 
5 were added 20 pmol of each of the oligonucleotides PL-N 

and PL-C2, and 10 m1 of a DNA-solution which contained 
0,1 mg/ml of chroxnosomal DNA from Peptostreptococcus 
maonus, strain 312, By way of an alternative, 20 pmol 
were added to each of the oligonucleotide pairs CDC-N 
10 and CDC-C and 10 ^1 of a DNA-solution which contained 

0.1 mg/ml of chromosomal DNA from a group C streptococ- 
cus strain ( Streptococcus ecfuisimilis ) called C40 
(U. Sjobring, L, Bjorck and W. Kastern. 1991. Strepto- 
coccal protein G: Gene structure and protein binding 
15 properties. J. Biol. Chem. 266: 399-405 or with Ncol and 
Sail (10 V/fig PCR-product) , (for CDC) at 37«C. The thus 
amplified and subseqently cleaved DNA-f ragments were 
then separated by electrophoresis in a 2% (weight by 
volume) agrose gel (NuSieve agarose, FMC Bioproducts) in 
a TAE-buffer (40 mM Tris, 20 mM aNa-cetate, 2 mM EDTA, 
pH 8.0), The resultant fragments, 930 bp (for Bl-4) 
and 390 bp (for CDC) were cut from the gel. The 
concentration of DNA in the thus separated gel pieces 
was estimated to be 0.05 mg/ml. The agarose pieces cut 
from the gel and containing the cleaved, amplified 
fragments (Bl-4 and CDC) were melted in a water bath at 
65 "C, whereafter they were allowed to cool to 37*»C. 
10 Ml (0.5 pg) of this DNA were transferred to a semi- 
microtube (Sarstedt) , preheated to 37«C, whereafter 1 m1 
30 of the vector pHD389 which had been cleaved with Narl 
and Sail were added. 1 ^1 10 x ligase buffer (Promega) 
and 1 ;il T4 DNA-ligase (1 unit/^l) were also added. The 
ligating reaction was permitted to take place at 37 *c 
for 6 hours. The cleaving and ligating conditions recoro- 
35 mended by the producer of DNA-ligase and restriction 

enzymes (Promega) were followed in other respects. The 
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ligating reaction was then used to transform E. coli . 
strain LE392, which had been made competent in accor- 
dance with the rubidium-chloride/calcium-dichloride 
method as described by Kushner (1978) . Manipulation of 
5 DNA was effected in accordance with molecular biological 
standard methods (Sainbrook et al., 1989). 

Bzpression systftm 

10 The vector pHD389 (see Figure 2) is a modified variant 

of the plasmid pHD313 (Dalboge et al., 1989). The vector 
which was replicated in E> coli (contains origin of 
replication from plasmid pUC19) is constructed such that 
DNA-fragments which have been cloned in the cleaving 

15 site for Karl will be expressed immediately after, or 

downstream, of the signal peptide (21 amino acids) from 
the envelope protein oapA from E. coli . Translation will 
be initiated from the ATG-codon which codes for the 
first amino acid (methionine) in the signal peptide. The 

2 0 construction with an E. coli -individual signal sequence 

which precedes the desired peptide enables the translat- 
ed peptide to be transported to the periplasmic space in 
£. coli . This is beneficial since it reduces the risk of 
degradation of the desired product through the intracel- 

25 lular occurrent enzymes of £. coli . Furthermore, it is 

easier to purify peptides which have been exported to 
the periplasmatic space. Unique recognition sequences 
(multiple cloning sequences) for several other restric- 
tion enzymes, among them BcoRI,- Sail and BamHI are 

30 present immediately downstream of the Karl cleaving 
site. An optimized so-called Shine-Dalgarno sequence 
(also called ribosomal binding site, RBS) is found seven 
nucleotides upstream of the ATG-codon in the signal 
sequence from ompA, this optimized Shbine-Dalgarno 

35 sequence binding to a complementary sequence in 16S rRNA 

in the ribosomes and in a manner to decide that the 
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translation is initiated in the correct place. The 
transcription of such DNA as that which is co-trans- 
cribed with the signal sequence for ospA is controlled 
by the P^-promotor from coliphage X. The vector also 
contains the gene for cI857 from coliphage X, the prod- 
uct of which regulates-down transcription from and 
the product of which is expressed constitutively . This 
cI857-mediated down-regulation of transcription from p^ 
is heat-sensitive* Transcription which is regulated, or 
controlled, from this promoter will be terminated with 
the aid of a so-called rho-independent transcription 
terminating sequence which is inserted in the vector 
immediately downstream of the multiple cloning site. The 
plasmid also carries the gene for 6-lactamase (from the 
plasmid pUCl9) , the product of which permits ampicillin- 
selection of E. coli clones that have been transformed 
with the vector. 

Selection of protein LG-produced clones 

The transformed bacteria are cultivated on culture 
plates with LB-medium which also contained ampicillin in 
a concentration of lOO Mg/ml. The bacteria were culti- 
vated overnight at 3 0*»C, whereafter they were trans- 
ferred to a cultivation cabinet (42 "C) and cultured for 
a further four (4) hours. The plates were stored in a 
refrigerator overnight. On the following day, the colo- 
nies were transferred to nitrocellulose filters. The 
filters and culture plates were , marked, so that the 
transferred colonies could later be identified on the 
culture plate. The culture plates were again incubated 
overnight at 30 °c, so that rests of transferred bacteria 
colonies remaining on the plates could again grow. The 
plates were then stored in a refrigerator. The filter 
was incubated in 10% SDS for 10 minutes, so as to lyse 
the bacteria in the colonies on the nitrocellulose 
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impression* Filters containing lysed bacteria were then 

rinsed with a blocking buffer consisting of PBS (pH 7.2) 

with 0.25% gelatine and 0.25% Tween-20 (four baths of 

250 ml at 37«C) , whereafter the filter was incubated 

125 

5 with radioactively (marked with I according to the 
chloromine-T-method) marked Ig-<-chains (20 ng/ml) in 
PBS with 0*1% gelatine). The incubation process took 
place at room temperature for four (4) hours, whereafter 
non-bound radioactively marked protein was rinsed-off 

10 with PBS (pH 7.2) containing 0.5 M NaCl, 0.25% gelatine 

and 0.25% Tween-20 (four baths, 250 ml each at room tem- 
perature). All filters were exposed to X-ray film. 
Positive colonies on the original culture plate were 
identified. A number of positive colonies were re- 

15 cultivated on new plates and new colony-blot experiments 
were carried out with these plates as a starting materi- 
al with the intention of identifying K. coli colonies 
which bind IgG Fc. These tests were carried out in 
precisely the same manner as that described above with 

20 respect to the identification of £. coli -colonies which 

expressed Ig light-chain-binding protein, with the 

125 

exception that a radioactively roarked\( I) IgG Fc (20 
ng/ml) was used as a probe. Clones which reacted with 
both proteins were selected and analyzed with regard to 

25 the size of the DNA-fragment introduced in the vector. 

One of these clones was chosen for production of protein 
LG, pHDLG. The DNA't taken from this clone and intro- 
duced into plasmid pHD389 was sequenced. The DNA-se- 
quence exhibited full agreement, with corresponding 

3 0 sequences (B1-B4 and 21 bases in B5) in the gene for 

protein L from Peptostreptococcus maanus , strain 312, 
and with C1DC2 sequence in group C streptococcus strain 
C4 0. The size and binding properties of the protein 
produced from clone pHDLG was analyzed with the aid of 

3 5 SDS-PAGE (see Figure 8), dot-blot experiment (see Figure 

10) and competitive binding experiments. 
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Production of protein I*G 

Several colonies from a culture plate with £• coli pHDLG 
were used to inoculate a preculttire (LB-raedium with an 
5 addition of 100 mg/1 ampicillin) were cultivated at 28*»C 
overnight. In the morning, the preculture was trans- 
ferred to a larger volume (100 times the volume of the 
preculture) of fresh LB-medium containing ampicillin 
(100 mg/1) and was cultivated in vibrating flasks (200 

10 rpm) , (or fermenters) at 28*C. When an absorbence value 

of 0.5 was reached at 620 nm, the cultivation tempera- 
ture was raised to AO^C (induction of transcription). 
The cultivation process was then continued for 4 hours 
(applies only to cultivation in vibrated flasks) . The 

15 bacteria were centrifuged down upon termination of the 

cultivation process. The bacteria were then lysed at 
in accordance with an osmotic shock method (Dalboge et 
al., 1989). The lysate was adjusted to a pH of 7. Re- 
maining bacteria rests were centrifuged down and the 

2 0 supernatent then purified on IgG-sepharose, in accor- 

dance with the protocol earlier described with reference 
to protein G and protein L. (Sjobring et al., 1991, 
Kastern et al. , 1992). 

25 The expression system gave about 30 mg/1 of protein LG 

when cultivation in vibrated flasks. A deposition has 
been made at DSSM, Identification Reference DSSM coli 
LE3 92/PHDLG, 

30 Example 3 

Analysis of the binding properties of protein LG 

Western Blot 

35 

Protein G (the ClDC2-f ragment) , protein L (four B- 
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10 



domains) and protein LG were isolated with SDS-PAGE (1C% 
acrylamide concentration) . The isolated proteins were 
transfered to nitrocellulose membranes in three similar 
copies (triplicate) . Each of these membranes was incu- 
bated with radioactively marked proteins (20 ng/ml: one 
of the membrane-copies was incubated with human poly- 
clonal IgG, another with hiiman IgG Fc-fragment and the 
third with isolated human IgG xchains. Non-bound radio- 
actively marked proteins were rinsed off and all filters 
were then exposed to X-ray film. 



i 



15 



20 



25 



Slot-blot 

Human polyclonal Ig-preparations and Ig-fragments were 
applied with the aid of a slot-blot appliances on nitro- 
cellulose filters in given quantities (see Figure 10) on 
three similar copies. Each of these membranes was incu- 
bated with radioactively marked proteins (20 ng/ml) . One 
of the membrane copies was incubated with protein LG, 
another with protein L and the third with protein G. 
Non-bound radioactively marked proteins were rinsed-off 
and all filters were then exposed to X-ray film. 

The results are shown in Figures 9 and 10, 

Other binding experiments have been carried out, with 
the following results: 
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Binding of the proteins G, h and LG to immunoglobulins. 



Binding protein: 



K 



LG 



Immunoglobulin 



•k 

Polyclonal IgG 
IgG subclasses 
IgG 

IgG^ 
IgG fragment 

♦ 

Fc 

F(ab')2* 
kappa 

lambda 

other Ig-classes 

IgH 

IgA 

IgE 

IgD 



67 (10) 

2.0 
3.1 

4.7 



+ 
+ 



+ 6.0 (0.5) 

0.4 (0.2) + 



9.0 



1.5 



11.6 
10.4 



20 



+ 
-»- 



+ 



other Species: 
Polyclonal 
Monkey + 
Rabbit IgG 
IgG-Fc 

IgG-F(ab')2 
Mouse 
Rat 
Goat 



70 
3.0 
0.44 

41 
1.5 

14 



0.074 



2.6 
0.39 



+ 
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TZ^BLE (cont'd.) 



Binding of the proteins L and LG to immunoglobulins. 



Binding protein: G K 



L K 



LG K 



5^ 



Immunoglobul in 



Bovine 
Horse 

Guinea Pig 

Sheep 

Dog 

Pig 

Hamster 

Cat 

Hen 

Monclonals 

Mouse 

IgG^ 



IgG^ 



+ 

+ 



+ 



IgG 
IgG 
IgG. 
IgM" 
IgA 
Rat 
IgG 
IgG 
IgG 



2a 
2b 



2a 
2b 
2c 



+ 



+ 
+ 



W 

i 
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-X * 

K = affinity constant (M ). The nuaerals within parenthesis 
a 

disclose the affinity of a recombinant protein G comprised of two 
IgG-binding domains. weak bond to lambda chains exists. 
Binding to PI and PLC depends on the type of light chain of Ig. 

It will thus be seen that the synthesized hybrid protein LG has a 
broad binding activity/specificity. 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: HighTech Receptro AB 

(B) STREET: c/o Active, Skeppsbron 2 

(C) CITY: MALMO 

(E) COUNTRY: SWEDEN 

(F) POSTAL CODE (ZIP) : 211 20 

(G) TELEPHONE: 040/35 07 00 

(H) TELEFAX: 040/ 23 74 05 

(I) TELEX: 32637 Active S 

(ii) TITLE OF INVENTION: Hybridprotein 
(ill) NUMBER OF SEQUENCES: 1 

(iv) COMPUTER READABLE FORM: " 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentin Release #1.0, Version #1.25 

(EPO) 

(v) CURRENT APPLICATION DATA: 

APPLICATION NUMBER: SE PCT/SE93/ 00375 
(vi) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: SE 92013 31-7 

(B) FILING DATE: 2 8 -APR- 199 2 

(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 305 aunino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli LE392/pHDL, DSM 7054 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

Ala Val Glu Asn Lys Glu Glu Thr Pro Glu Thr Pro Glu Thr Asp Ser 
15 10 15 

Glu Glu Glu Val Thr He Lys Ala Asn Leu He Phe Ala Asn Gly Ser 
20 25 30 

Thr Gin Thr Ala Glu Phe Lys Gly Thr Phe Glu Lys Ala Thr Ser Glu 
35 40 45 
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Ala Tyr Ala Tyr Ala Asp Thr Leu Lys Lys Asp Asn Gly Glu Tyr Thr 
50 55 60 

Val Asp Val Ala Asp Lys Gly Tyr Thr Leu Asn He Lys Phe Ala Gly 
^5 70 75 80 

Lys Glu Lys Thr Pro Glu Glu Pro Lys Glu Glu Val Thr lie Lys Ala 
85 90 95 

Asn Leu He Tyr Ala Asp Gly Lys Thr Gin Thr Ala Glu Phe Lys Gly 
100 105 110 

Thr Phe Glu Glu Ala Thr Ala Glu Ala Tyr Arg Tyr Ala Asp Ala Leu 
115 120 125 

Lys Lys Asp Asn Gly Glu Tyr Thr Val Asp Val Ala Asp Lys Gly Tyr 
130 135 140 

Thr Leu Asn He Lys Phe Ala Gly Lys Glu Lys Thr Pro Glu Glu Pro 

150 155 160 

Lys Glu Glu Val Thr He Lys Ala Asn Leu He Tyr Ala Asp Gly Lys 
165 170 175 

Thr Gin Thr Ala Glu Phe Lys Gly Thr Phe Glu Glu Ala Thr Ala Glu 
180 185 190 

Ala Tyr Arg Tyr Ala Asp Leu Leu Ala Lys Glu Asn Gly Lys Tyr Thr 
195 200 205 

Val Asp Val Ala Asp Lys Gly Tyr Thr Leu Asn He Lys Phe Ala Gly 
210 215 220 

Lys Glu Lys Thr Pro Glu Glu Pro Lys Glu Glu Val Thr He Lys Ala 
225 230 235 240 

Asn Leu He Tyr Ala Asp Gly Lys Thr Gin Thr Ala Glu Phe Lys Gly 
245 250 255 

Thr Phe Ala Glu Ala Thr Ala Glu \la Tyr Arg Tyr Ala Asp Leu Leu 
260 265 270 

Ala Lys Glu Asn Gly Lys Tyr Thr Ala Asp Leu Glu Asp Gly Gly Tyr 
275 280 , 285 

Thr He Asn He Arg Phe Ala Gly Lys Lys Val Asp Glu Lys Pro Glu 
290 295 300 

Glu 
305 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 921 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: unknown 
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(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli LE392/pHDL, DSM 7054 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 
GCGGTAGAAA ATAAAGAAGA AACACCAGAA ACACCAGAAA CTGATTCAGA 50 

AGAAGAAGTA ACAATCAAAG CTAACCTAAT CTTTGCAAAT GGAAGCACAC 100 

AAACTGCAGA ATTCAAAGGA ACATTTGAAA AAGCAACATC AGAAGCTTAT 150 

GCGTaTGCAG ATACTTTGAA GAAAGACAAT GGAGAATATA CTGTAGATGT 200 

TGCAGATAAA GGTTATACTT TAAATATTAA ATTTGCTGGA AAAGAAAAAA 250 

CACCAGAAGA ACCAAAAGAA GAAGTTACTA TTAAAGCAAA CTTAATCTAT 3 00 

GCAGATGGAA AAACACAAAC AGCAGAATTC AAAGGAACAT TTGAAGAAGC 350 

AACAGCAGAA GCATACAGAT ATGCAGATGC ATTAAAGAAG GACAATGGAG 400 

AATATACAGT AGACGTTGCA GATAAAGGTT ATACTTTAAA TATTAAATTT 450 

GCTGGAAAAG AAAAAACACC AGAAGAACCA AAAGAAGAAG TTACTATTAA 500 

AGCAAACTTA ATCTATGCAG ATGGAAAAAC ACxiAACAGCA GAATTCAAAG 550 

GAACATTTGA AGAAGCAACA GCAGAAGCAT ACAGATATGC TGACTTATTA 600 

GCAAAAGAAA ATGGTAAATA TACAGTAGAC GTTGCAGATA AAGGTTATAC 650 

TTTAAATATT AAATTTGCTG GAAAAGAAAA AACACCAGAA GAACCAAAAG 700 

AAGAAGTTAC TATTAAAGCA AACTTAATCT ATGCAGATGG AAAAACTCAA 750 

ACAGCAGAGT TCAAAGGAAC ATTTGCAGAA GCAACAGCAG AAGCATACAG 800 

ATACGCTGAC TTATTAGCAA AAGAAAATGG TAAATATACA GCAGACTTAG 850 

AAGATGGTGG ATACACTATT AATATTAGAT TTGCAGGTAA GAAAGTTGAC 900 



GAAAAACCAG AAGAATAATA a 



(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 434 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 
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Glu Glu Glu Val Thr He Lys Ala Asn Leu He Phe Ala Asn Gly Ser 
20 25 30 



Ala Tyr Ala Tyr Ala Asp Thr Leu Lys Lys Asp Asn Gly Glu Tyr Thr 
50 55 60 



Lys Glu Lys Thr Pro Glu Glu Pro Lys Glu Glu Val Thr He Lys Ala 
225 230 235 240 

Asn Leu He Tyr Ala Asp Gly Lys Thr Gin Thr Ala Glu Phe Lys Gly 
245 250 255 

Thr Phe Ala Glu Ala Thr Ala Glu Ala Tyr Arg Tyr Ala Asp Leu Leu 
260 265 270 
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(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli LE392/pHDLG, DSM 7055 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 1^ 

Ala Val Glu Asn Lys Glu Glu Thr Pro Glu Thr Pro Glu Thr Asp Ser 
15 10 15 



Thr Gin Thr Ala Glu Phe Lys Gly Thr Phe Glu Lys Ala Thr Ser Glu 
35 40 45 



Val Asp Val Ala Asp Lys Gly Tyr Thr Leu Asn He Lys Phe Ala Gly i>: 
65 70 75 80 



Lys Glu Lys Thr Pro Glu Glu Pro Lys Glu Glu Val Thr He Lys Ala 
85 90 95 

Asn Leu He Tyr Ala Asp Gly Lys Thr Gin Thr Ala Glu Phe Lys Gly 
100 105 110 

Thr Phe Glu Glu Ala Thr Ala Glu Ala Tyr Arg Tyr Ala Asp Ala Leu 
115 120 125 

Lys Lys Asp Asn Gly Glu Tyr Thr Val Asp Val Ala Asp Lys Gly Tyr 
130 135 140 

Thr Leu Asn He Lys Phe Ala Gly Lys Glu Lys Thr Pro Glu Glu Pro ^ 
145 150 155 160 - 

Lys Glu Glu Val Thr He Lys Ala Asn Leu He Tyr Ala Asp Gly Lys y 
165 170 175 

Thr Gin Thr Ala Glu Phe Lys Gly Thr Phe Glu Glu Ala Thr Ala Glu V: 
180 185 190 

Ala Tyr Arg Tyr Ala Asp Leu Leu Ala Lys Glu Asn Gly Lys Tyr Thr 
195 200 205 

\' 

Val Asp Val Ala Asp Lys Gly Tyr Thr I^u Asn He Lys Phe Ala Gly !> 
210 215 220 ^ 



wo 93/22342 

Ala Lys Glu 
275 

Thr lie Asn 
290 

Glu Pro Met 

305 

Gly Glu Thr 

Phe Lys Gin 

Asp Asp Ala 
355 

Asp Ala Ser 
370 

Asn Gly Lys 
385 

Glu Thr Ala 
Asp Gly Val 
Glu Met 



39 



Asn Gly Lys Tyr Thr Ala Asp Leu Glu Asp 
280 285 

He Arg Phe Ala Gly Lys Lys Val Asp Glu 
295 300 

Asp Thr Tyr Lys Leu He Leu Asn Gly Lys 
310 315 

Thr Thr Glu Ala Val Asp Ala Ala Thr Ala 
325 330 

Tyr Ala Asn Asp Asn Gly Val Asp Gly Glu 
340 345 

Thr Lys Thr Phe Thr Val Thr Glu Lys Pro 
360 365 

Glu Leu Thr Pro Ala Val Thr Thr Tyr Lys 
375 380 



Thr Leu Lys Gly Glu Thr 
390 

Glu Lys Ala Phe Lys Gin 
405 

Trp Thr Tyr Asp Asp Ala 
420 425 



Thr Thr 

395 



Lys Ala 
Asn Asp 
Thr Lys Thr Phe 



Tyr Ala 
410 



PCr/SE93/00375 
Gly Gly Tyr 
Lys Pro Glu 



Thr Leu Lys 
320 

Glu Lys Val 
335 

Trp Thr Tyr 
350 

Glu Val He 



Leu Val He 



Val Asp Ala 
400 

Asn Gly Val 
415 

Thr Val Thr 
430 



(2) INFORMATION FOR SEQ ID NO: 4: ^ 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1308 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichis coli L392/pHDLG, DSM 7055 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

GCGGTAGAAA ATAAAGAAGA AACACCAGAA ACACCAGAAA CTGATTCAGA 50 

AGAAGAAGTA ACAATCAAAG CTAACCTAAT CTTTGCAAAT GGAAGCACAC 100 

AAACTGCAGA ATTCAAAGGA ACATTTGAAA AAGCAACATC AGAAGCTTAT 150 

GCGTATGCAG ATACTTTGAA GAAAGACAAT GGAGAATATA CTGTAGATGT 200 
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■M 



m 



TGCAGATAAA 


GGTTATACTT 


TAAATATTAA 


ATTTGCTGGA 


AAAGAAAAAA 


250 


CACCAGAAGA 


ACCAAAAGAA 


GAAGTTACTA 


TTAAAGCAAA 


CTTAATCTAT 


300 


GCAGATGGAA 


AAACACAAAC 


AGCAGAATTC 


AAAGGAACAT 


TTGAAGAAGC 


350 


AACAGCAGAA 


GCATACAGAT 


ATGCAGATGC 


ATTAAAGAAG 


GACAATGGAG 


400 


AATATACAGT 


AGACGTTGCA 


GATAAAGGTT 


ATACTTTAAA 


TATTAAATTT 


450 


GCTGGAAAAG 


AAAAAACACC 


AGAAGAACCA 


AAAGAAGAAG 


TTACTATTAA 


500 


AGCAAACTTA 


ATCTATGCAG 


ATGGAAAAAC 


ACAAACAGCA 


GAATTCAAAG 


550 


GAACATTTGA 


AGAAGCAACA 


GCAGAAGCAT 


ACAGATATGC 


TGACTTATTA 


600 


GCAAAAGAAA 


ATGGTAAATA 


TACAGTAGAC 


GTTGCAGATA 


AAGGTTATAC 


650 


TTTAAATATT 


AAATTTGCTG 


GAAAAGAAAA 


AACAGCAGAA 


GAACCAAAAG 


700 


AAGAAGTTAC 


TATTAAAGCA 


AACTTAATCT 


ATGCAGATGG 


AAAAACTCAA 


750 


ACAGCAGAGT 


TCAAAGGAAC 


ATTTGCAGAA 


GCAACAGCAG 


AAGCATACAG 


800 


ATACGCTGAC 


TTATTAGCAA 


AAGAAAATGG 


TAAATATACA 


GCAGACTTAG 


850 


AAGATGGTGG 


ATACACTATT 


AATATTAGAT 


TTGCAGGTAA 


GAAAGTTGAC 


900 


GAAAAACCAG 


AAGAACCCAT 


GGACACTTAC 


AAATTAATCC 


TTAATGGTAA 


950 


AACATTGAAA 


GGCGAAACAA 


CTACTGAAGC 


TGTTGATGCT 


GCTACTGCAG 


1000 


AAAAAGTCTT 


CAAACAATAC 


GCTAACGACA 


ACGGTGTTGA 


CGGTGAATGG 


1050 


ACTTACGACG 


ATGCGACTAA 


GACCTTTACA 


GTTACTGAAA 


AACCAGAAGT 


1100 


GATCGATGCG 


TCTGAATTAA 


CACCAGCCGT 


GACAACTTAC 


AAACTTGTTA 


1150 


TTAATGGTAA 


AACATTGAAA 


GGCGAAACAA 


CTACTAAAGC 


AGTAGACGCA 


1200 


GAAACTGCAG 


AAAAAGCCTT 


CAAACAATAC 


GCTAACGACA 


ACGGTGTTGA 


1250 


TGGTGTTTGG 


ACTTATGATG 


ATGCGACTAA 


GACCTTTACG 


GTAACTGAAA 


1300 


TGTAATAA 










1308 
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^ Protein L having rhc abiliry to bind to the ligh.r 
chains of imaunoglobulins, characterized 
in that the protein L has the folloving amino acid 
sequence: 

31 

Aia VaZ GIu ksr*ilym Ciu Glu Thr Pro Clu Thr Pro Clu Tftr Xsp Ser 
1 ; 5 IC 15 

Clu aiu Clu Val Thr lie Ly« Ala Asn Leu lie Phe Xla X«n Gly Ser 
20 25 30 

Thr Gin Thr Ala Ciu Phe Lys Gly Thr Ph« Glu Lys Ala Ttir Ser Clu 
35 ' <0 ' 45 

Ala Tyr Ala Tyr Ala Asp Thr Leu Lya Lys Asp A»n Gly Glu Tyz Thr 
50 55 60 

Vel A»p Vel Ala A«p Ly« Gly Tyr Ttxr Leu A«n He Lye Phe Ala Giy 

€5 22 ^0 75 80 

,Ly« Glu Ly» Thr Pro Glu Glu Pro LyK Glu Glu Val Thr He Lye Ala 
as 90 95 

Asn L«u lie Tyr Aia Acp Giy lys Thr Gin Thr Ala Glu Phe Lys Gly 
100 105 ItO 

Thr Phe Clu Glu Ala Thr Ala Glu Ala Tvr hrrj Tyr Ala Asp Ala Leu 
115 120 * 125 

Lya Ly« Asp kmn Cly Clu Tyr Thr Val A«p Val Ala Aep Lys cly Tyr 

130 125 140 

B3 

Thr L«u A»n lie Lye Phe Ala Gly Lys Clu Lye Tftr Pro Clu Glu Pre 
14 5 ' 150 155 160 

Lys Glu Glu Val Thr He Lya Ala Asn Leu He Tyr Ala Asp Giy Lys 
165 170 175 

Thr Gin Thr Ala Glu Phe Lye Gly Thr Phe Glu Glu Ala Thr Ala Ciu 
180 185 190 

Ala Tyr Arg Tyr Ala Asp Leu Leu Ala Lya Glu A«n Gly Lye Tyr Thr 

195 -200 205 

Val Asp Val Ala Asp Lys Gly Tyr Thr L«u Aen He Lye Phe Aia Gly 
73^ 215 ?.20 



AMENDED SHEET 



.Va^ - L 

->s Clu Lys Tftr ?rc G. • gJ.. Fro Lys Gi-. Val He Us Ala 



Asn Uu He Tyr Ala Asp Cly Lys Thr Gir, Thr Ala Clu Phe Lys Cly 



280 — 35 285 



Thr lie Asn lie Arg Phe Ala GlyjLys Lys Val Acp Glu Lys Pro Giu 
Gl-J 



295 . 300 



30 



2iC 



2S0 255 V 



Thr Ph. Ala Glu Ala Thr Ala ciu Ala Tyr Arg Tyr Ala Asp X*u Leu ^- 

26S 270 r" 

a: a Lys Clu Asn Cly Lys Tyr Thr Ma Asp Leu Glu Asp Gly Gly Tvr ^ 

275 5fln^- -flc * f-;. 



an. variants, .ubf rag^enrs. .ul^.iples or fixtures rh. ^ 
^o»a..e 3X-35 i^ving the sa.e Ending prope^^Is ^ 



1?- 

20 ir codes ■F^>- r-y. - * = « - 1 2 e d in that 5; 

. codes ror rhe protein according rc Clai. i and --as ^ 

.ha fo._ovxng nucleotide sequence: " - 

=CG CIA GAA ;^ ^ 

'^^'^ GAA ACT GAT TCIl s 

..C A^ ,~ AAC C7A ATC TTT GCA AAX GGA AGC - ^ 



c« .c. ^ rrr ^ c„ 

'cr T« === AM «c «i ,M T*. 

cr. ^ ^ MT .1, M. ^ 

«c ..c ^ ^ ^ I 

- c... ^ ^ ■ 

"■^O tJ<z GAC AAT GGA GAA -IT i.-, , V^' 

CAC CTT GCA GAT AAA GGT TAT .J2 J^C 

i.A AAT ATT XXX _ tA"' 

^ — ==A ^ GAA AAA ACA CCA =AA GAA CCA .80 i^-^ 

.-^^ GAA GAA GTT A^T t^' 
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10 



15 



ACA CAA ACA GCA GAA A,AlA G cA ACA TTT GXA GA-A 3CA nCA SCA 3AA bTt 

GCA TAC AGA TAT OCT GAC TTA TTA GCA AAA GAA AAT GGT AAA TAT ACA d2c 

GTA GAC GTT GCA GAT AAA GGT TAT ACT TTA AAI ATT AAA TTT GCT CGA 572 

AAA GAA AAA ACA CCA GAA GAA CCA AAA GAA GAA CTT ACT ATt AAA GCA '22 

A-AC TTA ATC TAT GCA GAT GCA AAA ACT CAA ACA GCA GAG TTC AAA GGA /6S 

ACA TTT GCA GAA GCA ACA GCA GAA GCA TAC AGA TAC CCT GAC TTA TTA Sl£ 

GCA AAA GAA AAT GGT AAA TAT ACA GCA GAC TTA GAA GAT GGT GGi TAC 5St 

ACT A— .\Ar ATT AGA TTT GCA GGT AAG AAA GTT GAC GAA AAA CCA GAA 912 



3. A hybrid prarein, characterized m 
-that i- includes one or more of th« 3l-B5-ao3aains ac- 
cording CO Claim 1 whicn h-ind to the light chair^s ir, 
irununoglobulxns of all classes, and domains which bind 
ro heavy chains in imiaujioglobuliri G, 

4. A hybrid protei>i according to ClaXTn 3, char- 
acterized in that the domains which bind ro 
heavy chains in inmunoglobuiin G aire chosen from aaong 
the Cl- and C2 -domains in protein G or from anong any 
other functionally similar proteins which iTind no heavy 
chains in immunoglobulin G, and variants, subfragaents , 
multiples or jnixrures thereof having the saa^ binding 
properties, 

5. A hybrid protein according to claim 4, char- 
acterized in that the hybrid prc^t^in has the 
following amino acid sequence; 
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ALa V«l 'sn lY« sip Clu Thr Pro Olo Thr ?rs Glu T.^.r Asp Ser 

- * w 

t;':u aiu GI\: VaX I^r :i« Lvs Ala Asn I^u lie ?he A-la Asn Gly Ser 
13 25 30 

Ztr Sir. inr Ale Glu ?>2e Lys Gly Tnr ?h« Glu Lys Ala Thr Ser GI*^ 
35 40 45 

Ala Tyr Aia Tyr Ala Asp Thr Leu I.ys Lys Asp Asn Gly Glu Tyr Thr 
50 * * 55 * 60 

Val Asp Vsl Ala Asp Lvs Gly Tyr Ttz Leu Asn lie I-y* ^-Y 
65 70 75 

Lvs Glu Lvs rnr ?rc Glu Glu Pro ■i.ys Glu Glu 'VaI Thr He Lys Aid 
55 9C '5 

Asn Leu He Tyr Ala Asp Gly Lys Thr Gin Thr Ala Glu Phe Lys Gly 
100 ' * 105 11^ 

Thr r-he Glu Glu Ala Thr Ala Glu Ala Tyr Arg Tyr Xla Asp Ala :>eu 
115 120 125 

Lys Lys Asp Asn Gly Glu Tyr Thr Val Asp Val Ala As? Lys Gly T^-r 
13C 135 140 

Thr L«u Asn lie Lvs ?he Ala Glv Lys Glu Lys Thr Pro Glu Glu ?rc 

145 ' 150 ' 155 

LvS Glu Glu Val Thr lie Lvs Ala Asn Leu Ila Tyr Ala Asp Gly Lys 

' 1€5 ' :?0 

T^.r Gin Thr Ala Glu ?he Lys Gly Tnr Phe Glu Glu Ala Tnr Ala Glu 
180 IBS 

Ala Tyr Arg Tyr Ala As? Leu Leu Ala Lys Glu Asn Gly Lys Tyr Thr 
195 200 205 

Val Asp Val Ala Asp Lys Gly Tyr Thr Leu Asn lit Lys ?iie Ala Gly 

210 215 220 

Lvs Glu Lys Tnr ?ro Glu Glu Pro 'Lys Glu Glu Val Thr lie Lys Ala 
225 230 235 240 

Asn Leu lie Tyr Ala Asp Gly Lys Thr Gin Thr Ala Glu ?he Lys Gly 
245 250 255 



20 
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rr.r Phe Ala Glu Ala rr.r A.!- GIu Ala Krg Tyr Xla Xsts Leu Leu 

^^'^ 265 270 

Ala Lys Glu Asn Gly Lys ^/r TTir Ala Asp Leu Glu Asp GIv cly r>'- 

-'5 28C • 



Ihr ^Ile Asr* He Arg p.^,e Al 



;iy Lys Lys Vil Asp clu Lys Pro Glu 



2'^' 300 

aiu Prp Ker Asp TKr Tyr tys l^:; lie lau Asr. r^ly lys Tr.r l^w lys 

^10 315 ' 310 

Gly Glu r.^r rr.r T.nr GIj Ala Val Asp Ala Ala Thr Ala Clu Lys val 
-25 335 

?ne Lys Gin Tyr Ala Asr. Asp Asn Gly Val Asp Gly Glu Trp T-r Tyr 
3^0 345 

Asp Asp Ala Thr Lys Ttiz Phe Thr Val r^.r Glu Lys Pro Glu Vai lie 
555 3eo 365 

Asp Ala Ser Glu Leu Tnr Pro Aia Val rhr Thr Tyr Lys Leu Vai lie 

310 

Asr. Gly Lys Thr >eu Lys Gly Glu Thr Thr Thr Lys Ala Val Asp Ala 

190 295 40C 

Glu Thr Ala Glu Lys Ala Phe Lys Gin T>*r Ala Asn Asp Asn Gly Val 
405 43^5 

Asp Gly Val Trp Thr Tyr Asp Asp Ala Thr Lvs Thr Phe Thr Va.! Thr 

425 

Glu Mer 



and variants, ^ul^f raga«tts , auatiple« or mixpures of rhe 
domains Bl-BS having tilxe sMse binding properries, 

S. DNA^ftatju^e, cfaaracrerized in rhat 
ir codafi for a prorein according ro ciaiiL 5 and hafi the 
following nucleotide sequence: 
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GCG 


w . A 


. 3AA 


AAT 


AAA 


G AA 


GAA 


ACA 


CCA 


GAA 




CCA 


GAA 


ACT 


CAT 


, wA 


-c 




GAA 


f~ \ * 




ACA 


A- C 


AAA. 


GCT 




CTA 


ATC 


TCT 


GCA 


AAT 




AGC 




Alt 


CAA 


Ar? 


3CA 




TTC 


AAA 


GwA 


ACA 


TTT 




AAA 


GCA 


ACA 


TCA 


GAA 






TAX 




* AT 


GCA 


GAT 


ACT 


* * w* 


AAG 


AAA 


GAC 


AAT 


GGA 


GAA 


TAT 


ACT 




GTA 


GAT 


GTT 


GCA 


GAT 


AAA 


GGT 




ACT 


TTA 


AAT 


ATT 


AAA 


TTT 


GCT 


GGA 




AAA 


OAA 


AAA 


ACA 


CCA 


GAA 


GAA 


CCA 


AAA 


GAA 


GAA 


GTT 


ACT 


ATT 


AAA 


CCA 


25S 


AAC 


TTA 


ATC 


TA J. 


GCA 


GAT 


GGA 


AAA 


ACA 


CAA 


ACA 


GCA 


GAA 


TTC 


AAA 


GGA 




ACA 


TTT 


GAA 


CAA 


GCA 


ACA 


GCA 




GCA 


TAG 


ACA 


TAT 


GCX 


GAT 


GCA 






AAG 


AAG 


GAC 


AAT 


GGA 


GAA 


TAT 


ACA 


GTA 


GAC 


GTT 


GCA 


GAT 


AAA 


GGT 


TAT 




ACT 


TTA 


AAT 


ATT 


AAA 




CCT 


GGA 


AAA 


GAA 


AAA 


ACA 


CCA 


GAA 


CAA 


CCA 




AAA 


CAA 


GAA 


GTT 




^ m*T- 


AAA 


^*wA 


% « ^ 




ATC 


TAT 


GCA 


GAT 


» 


A^-A 




ACA 


WhA 


ACA 




GAA 


' 1" ' ■r' 


AAA 


w*^A 


ACA 




GAA 


GAA 


CCA 


ACA 


GCA 


GAA. 





GCT GGA 67 



3D 



GCA TAC AGA TAT GCT GAC TTA TTA GCA AAA GAA AAT GCT AAA 
GTA GAC GTT GCA GAT AAA GCT TAT ACT TTA AAT ATT AAA TTT 
AAA GAA AAA ACA CCA GA^. GAA CCA. AAA. GAA GAA GTT ACT ATT AAA GCA ~ZZ 
>AC TTA ATC TAT GCA GAT GGA AAA ACT CAA ACA GCA GAG CTC AAA GGA ~63 
ACA TTT GCA GAA GCA ACA GCA GAA GCA TAC AGA TAC GCT GAC TTA TTA 5 "-6 
GCA AAA GAA AAT GGT AAA TAT ACA GCA GAC TTA GAA GAT CCT GC^ TAC rt- 
ACT ATT AAT ATT AGA TTT GCA GGT AAG AAA GTT GAC GAA AAA CCA GAA 
GAA CCC ATG GAC ACT TAC AAA TTA ATC CTT AAT GGT AAA ACA TTG AAA 
GGC GAA ACA ACT ACT GAA GCT GTT GAT GCT CCT ACT GCA GAA AAA GTC 
TTC AAA CAA TAC GCT AAC GAC AAC GGT CTT GAC CCT GAaTcC ACT TAC 
GAC GAT GCG ACT AAG ACC TTT ACA CTT ACT GAA AAA CCA GAA CTG ATC 
GAT GCG TCT GAA TTA ACA CCA GCC GTG ACA ACT TAC AAA CTT CTT ATT 
AAT GGT AAA ACA TTG AAA GGC GAA ACA ACT ACT AAA G-CA CTA GAC GCA 
GAA ACT GCA GAA AAA GCC TTC AAA CA.A TAC GCT AAC GAC AAC GGT GTT M^iB 
GAT GGT GTT TCG ACT TAT GAT GAT GCG ACT AAC ACC TTT ACG CTA ACT :296 
GAA ATG TAATAA . 32g 



" ■ t 7 



AMENDEiLSHEET 



47 



7. DKA-sequance, character! 'zed in that 
it codes fcr a protein according tc Claims 3, 4 and 5. 

S- A plasmid vector^ characterized in 
5 that it includes a DKA-seguence according to any one of 
Claiics _2_a.nd._6r3_> preferably the vector pHDLG or pHDL 
according to Fig. 3 or 4* 

9. A host cell, characterized in thar it 
10 is transformed with the hyiDrid plasmid according to 

Clai^i 9 in particular a host which belongs tc the 
species £. coli > pjirt ocularly coli I*E392, or Bacillus 
subtilis, Saccarogvces cerevisiae . preferably Id. Kef. 
DSSM coli I:h:3S2 pHDL and £. coli LE292/pHi:iX^ ras- 
15 pectively. 

10 . A merhod for producing a protein according to 
Claims 1 and 3^-5^ characterized by cultiv- 
ating a host cell according to Claim 10 under suitable 

2C conditions; accuiaulering the protein in. the culture or 

lysing the cells and exTiractiing rhe prot-ein t^heref roin, 

11. A reagent kit for binding, separating and identi- 
fying immunoglobuiins, characterized in 

2 5 than it includes a pro'::ein according to any one of 

Claims 1 and 3-5. 



12. A composition, characterized in that 
ir includes a protein according to any one of Claims i 
3 0 and 3-5, and oprionally additives or carriers. 



13. A phamaceurical compos ition^ c h a r a c - 
t e r 2 e d in rhat ir includes a protein according 
to any one of Clains 1 and 3-5, and optionally a pharaa- 
35 cautlcally acceptaJble carrier ~cr extender. 
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ABSTRACT OF THE DISCLOSUKE 



The invention relates to sequences of protein L which bind 
to light chains of immunoglobulins. The invention also relates 
to hybrid proteins thereof which are able to bind to both light 
and heavy chains of immunoglobulin G, in particular protein IrG, 
The invention also relates to DNA-sequences which code for the 
proteins, vectors which include such DNA-sequences, host cells 
which have been transformed with the vectors, methods for 
producing the proteins, reagent appliances for separation and 
identification of immunoglobulins, compositions and 
pharmaceutical compositions and pharmaceutical compositions which 
contain the proteins. 
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posn of paytni reduced feci under section 41 (a) and (b) of Title 33. Untied Staie* Code, to the Patent and Trademark 
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i J applicalion serial no. ^ Hied Octct>er 26, 1994 . 

I J patent no. . issued „ '. 
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37 CFR 1.9 id) or a nonprofit organization under 37 CFR 1.9 (e). 
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and belief are believed to be true; and further that these statements were made with the knowledge that willful false siaiemenis 
and the like so made are punishable by fine or imprisonment, or both, under section 1001 of Title 18 of the United States 
Code, and that such willful fats* statements may jeopardize the vahdity of the application, any patent issuing thereon, or 
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MULTIPLE CLONING SEQUEN- 



INDEPENDENT 
TRANSCRIPTIONS 
TERMINATING 
SEQUENCES 



SIGNAL PEPTIDE FOR THE 
SEQUENCE FROM OnPA 



RBS = RIBOSOMAL AGGAGG 
BINDING SEQUENCE 

Pr = "RIGHT" PROMOTOR FROM COLIPHAGtA 

CI857 THE GENE FOR A HEAT-SENEITIVE REPRES- 
SOR-PROTEIN FROM COLIPHAGE A 

FIG 1 PLASMA dHD 389. THE RIBOSOMAL 

BINDING-SEQUENCE (E^mSIZED WITH 
A FULL LINE). THE SEQUENCE FOR SIGNAL PEPTIDE FROM otoA 
(FROM E.CGli) (DOTTED LINE) AND RECOGNITION SEQUEMTE FOR 
SEVERAL RESTRICTION ENZYMES ARE SHOWN. 
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FIG. 3 SCHEMATIC OVERALL VEIW OF THE PRODUCTION 
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